Overview

Brought to you by YData

Dataset statistics

Number of variables23
Number of observations656
Missing cells3110
Missing cells (%)20.6%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory494.3 KiB
Average record size in memory771.5 B

Variable types

Text9
Numeric8
Categorical4
Boolean1
URL1

Alerts

exclude has constant value "True" Constant
Column has constant value "http://www.shadowandact.com/?p=23430" Constant
Audience Score is highly overall correlated with Rotten TomatoesHigh correlation
Box Office Average per Cinema is highly overall correlated with Domestic Gross and 2 other fieldsHigh correlation
Budget is highly overall correlated with Domestic Gross and 1 other fieldsHigh correlation
Domestic Gross is highly overall correlated with Box Office Average per Cinema and 3 other fieldsHigh correlation
Profit is highly overall correlated with Box Office Average per Cinema and 2 other fieldsHigh correlation
Rotten Tomatoes is highly overall correlated with Audience ScoreHigh correlation
Worldwide Gross is highly overall correlated with Box Office Average per Cinema and 3 other fieldsHigh correlation
id is highly overall correlated with yearHigh correlation
year is highly overall correlated with idHigh correlation
exclude has 528 (80.5%) missing values Missing
Lead Studio has 109 (16.6%) missing values Missing
Number of Theatres in Opening Weekend (US) has 44 (6.7%) missing values Missing
Box Office Average per Cinema has 54 (8.2%) missing values Missing
Foreign Gross has 45 (6.9%) missing values Missing
Budget has 11 (1.7%) missing values Missing
Proftitability has 11 (1.7%) missing values Missing
Oscar has 640 (97.6%) missing values Missing
Bafta has 644 (98.2%) missing values Missing
Source has 351 (53.5%) missing values Missing
Column has 655 (99.8%) missing values Missing
id is uniformly distributed Uniform
movies_id has unique values Unique
id has unique values Unique

Reproduction

Analysis started2025-08-25 10:54:17.577891
Analysis finished2025-08-25 10:54:19.830213
Duration2.25 seconds
Software versionydata-profiling vv4.16.1
Download configurationconfig.json

Variables

movies_id
Text

Unique 

Distinct656
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size38.6 KiB
2025-08-25T12:54:19.901387image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Length

Max length11
Median length11
Mean length11
Min length11

Characters and Unicode

Total characters7216
Distinct characters17
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique656 ?
Unique (%)100.0%

Sample

1st rowmovies-0000
2nd rowmovies-0001
3rd rowmovies-0002
4th rowmovies-0003
5th rowmovies-0004
ValueCountFrequency (%)
movies-0000 1
 
0.2%
movies-0020 1
 
0.2%
movies-0009 1
 
0.2%
movies-0002 1
 
0.2%
movies-0003 1
 
0.2%
movies-0004 1
 
0.2%
movies-0005 1
 
0.2%
movies-0006 1
 
0.2%
movies-0007 1
 
0.2%
movies-0008 1
 
0.2%
Other values (646) 646
98.5%
2025-08-25T12:54:19.996458image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 892
12.4%
m 656
9.1%
v 656
9.1%
i 656
9.1%
e 656
9.1%
s 656
9.1%
- 656
9.1%
o 656
9.1%
1 236
 
3.3%
4 236
 
3.3%
Other values (7) 1260
17.5%

Most occurring categories

ValueCountFrequency (%)
(unknown) 7216
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
0 892
12.4%
m 656
9.1%
v 656
9.1%
i 656
9.1%
e 656
9.1%
s 656
9.1%
- 656
9.1%
o 656
9.1%
1 236
 
3.3%
4 236
 
3.3%
Other values (7) 1260
17.5%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 7216
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
0 892
12.4%
m 656
9.1%
v 656
9.1%
i 656
9.1%
e 656
9.1%
s 656
9.1%
- 656
9.1%
o 656
9.1%
1 236
 
3.3%
4 236
 
3.3%
Other values (7) 1260
17.5%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 7216
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
0 892
12.4%
m 656
9.1%
v 656
9.1%
i 656
9.1%
e 656
9.1%
s 656
9.1%
- 656
9.1%
o 656
9.1%
1 236
 
3.3%
4 236
 
3.3%
Other values (7) 1260
17.5%

id
Real number (ℝ)

High correlation  Uniform  Unique 

Distinct656
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean328.5
Minimum1
Maximum656
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.3 KiB
2025-08-25T12:54:20.029943image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile33.75
Q1164.75
median328.5
Q3492.25
95-th percentile623.25
Maximum656
Range655
Interquartile range (IQR)327.5

Descriptive statistics

Standard deviation189.51517
Coefficient of variation (CV)0.57691072
Kurtosis-1.2
Mean328.5
Median Absolute Deviation (MAD)164
Skewness0
Sum215496
Variance35916
MonotonicityStrictly increasing
2025-08-25T12:54:20.062568image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.2%
432 1
 
0.2%
434 1
 
0.2%
435 1
 
0.2%
436 1
 
0.2%
437 1
 
0.2%
438 1
 
0.2%
439 1
 
0.2%
440 1
 
0.2%
441 1
 
0.2%
Other values (646) 646
98.5%
ValueCountFrequency (%)
1 1
0.2%
2 1
0.2%
3 1
0.2%
4 1
0.2%
5 1
0.2%
6 1
0.2%
7 1
0.2%
8 1
0.2%
9 1
0.2%
10 1
0.2%
ValueCountFrequency (%)
656 1
0.2%
655 1
0.2%
654 1
0.2%
653 1
0.2%
652 1
0.2%
651 1
0.2%
650 1
0.2%
649 1
0.2%
648 1
0.2%
647 1
0.2%

year
Categorical

High correlation 

Distinct5
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size34.1 KiB
2008
149 
2010
146 
2009
136 
2011
134 
2007
91 

Length

Max length4
Median length4
Mean length4
Min length4

Characters and Unicode

Total characters2624
Distinct characters6
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2010
2nd row2010
3rd row2010
4th row2010
5th row2010

Common Values

ValueCountFrequency (%)
2008 149
22.7%
2010 146
22.3%
2009 136
20.7%
2011 134
20.4%
2007 91
13.9%

Length

2025-08-25T12:54:20.090482image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2025-08-25T12:54:20.117857image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
ValueCountFrequency (%)
2008 149
22.7%
2010 146
22.3%
2009 136
20.7%
2011 134
20.4%
2007 91
13.9%

Most occurring characters

ValueCountFrequency (%)
0 1178
44.9%
2 656
25.0%
1 414
 
15.8%
8 149
 
5.7%
9 136
 
5.2%
7 91
 
3.5%

Most occurring categories

ValueCountFrequency (%)
(unknown) 2624
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
0 1178
44.9%
2 656
25.0%
1 414
 
15.8%
8 149
 
5.7%
9 136
 
5.2%
7 91
 
3.5%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 2624
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
0 1178
44.9%
2 656
25.0%
1 414
 
15.8%
8 149
 
5.7%
9 136
 
5.2%
7 91
 
3.5%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 2624
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
0 1178
44.9%
2 656
25.0%
1 414
 
15.8%
8 149
 
5.7%
9 136
 
5.2%
7 91
 
3.5%

exclude
Boolean

Constant  Missing 

Distinct1
Distinct (%)0.8%
Missing528
Missing (%)80.5%
Memory size1.4 KiB
True
128 
(Missing)
528 
ValueCountFrequency (%)
True 128
 
19.5%
(Missing) 528
80.5%
2025-08-25T12:54:20.133830image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Film
Text

Distinct654
Distinct (%)99.7%
Missing0
Missing (%)0.0%
Memory size41.4 KiB
2025-08-25T12:54:20.232899image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Length

Max length56
Median length40
Mean length15.489329
Min length1

Characters and Unicode

Total characters10161
Distinct characters74
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique652 ?
Unique (%)99.4%

Sample

1st row127 Hours
2nd rowA Nightmare on Elm Street
3rd rowAlice in Wonderland
4th rowAll About Steve
5th rowAll Good Things
ValueCountFrequency (%)
the 205
 
11.3%
of 59
 
3.3%
and 25
 
1.4%
a 17
 
0.9%
in 16
 
0.9%
2 15
 
0.8%
to 11
 
0.6%
love 10
 
0.6%
you 10
 
0.6%
i 10
 
0.6%
Other values (1051) 1436
79.2%
2025-08-25T12:54:20.377445image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1295
 
12.7%
e 1009
 
9.9%
a 611
 
6.0%
r 588
 
5.8%
o 585
 
5.8%
n 533
 
5.2%
t 509
 
5.0%
i 508
 
5.0%
s 406
 
4.0%
h 395
 
3.9%
Other values (64) 3722
36.6%

Most occurring categories

ValueCountFrequency (%)
(unknown) 10161
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
1295
 
12.7%
e 1009
 
9.9%
a 611
 
6.0%
r 588
 
5.8%
o 585
 
5.8%
n 533
 
5.2%
t 509
 
5.0%
i 508
 
5.0%
s 406
 
4.0%
h 395
 
3.9%
Other values (64) 3722
36.6%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 10161
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
1295
 
12.7%
e 1009
 
9.9%
a 611
 
6.0%
r 588
 
5.8%
o 585
 
5.8%
n 533
 
5.2%
t 509
 
5.0%
i 508
 
5.0%
s 406
 
4.0%
h 395
 
3.9%
Other values (64) 3722
36.6%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 10161
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
1295
 
12.7%
e 1009
 
9.9%
a 611
 
6.0%
r 588
 
5.8%
o 585
 
5.8%
n 533
 
5.2%
t 509
 
5.0%
i 508
 
5.0%
s 406
 
4.0%
h 395
 
3.9%
Other values (64) 3722
36.6%

Lead Studio
Categorical

Missing 

Distinct47
Distinct (%)8.6%
Missing109
Missing (%)16.6%
Memory size37.5 KiB
Independent
110 
Paramount
55 
Warner Bros.
53 
Universal
46 
Sony
42 
Other values (42)
241 

Length

Max length25
Median length24
Mean length9.8391225
Min length3

Characters and Unicode

Total characters5382
Distinct characters47
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique21 ?
Unique (%)3.8%

Sample

1st rowIndependent
2nd rowWarner Bros.
3rd rowDisney
4th rowIndependent
5th rowIndependent

Common Values

ValueCountFrequency (%)
Independent 110
16.8%
Paramount 55
8.4%
Warner Bros. 53
8.1%
Universal 46
7.0%
Sony 42
 
6.4%
Fox 41
 
6.2%
Disney 35
 
5.3%
Independant 32
 
4.9%
Lionsgate 19
 
2.9%
Relativity Media 15
 
2.3%
Other values (37) 99
15.1%
(Missing) 109
16.6%

Length

2025-08-25T12:54:20.407613image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
independent 111
15.7%
warner 64
 
9.0%
bros 64
 
9.0%
paramount 55
 
7.8%
fox 50
 
7.1%
universal 46
 
6.5%
sony 43
 
6.1%
disney 35
 
4.9%
independant 32
 
4.5%
relativity 20
 
2.8%
Other values (46) 189
26.7%

Most occurring characters

ValueCountFrequency (%)
n 798
14.8%
e 704
13.1%
a 368
 
6.8%
t 348
 
6.5%
r 346
 
6.4%
d 312
 
5.8%
o 275
 
5.1%
i 251
 
4.7%
s 215
 
4.0%
p 172
 
3.2%
Other values (37) 1593
29.6%

Most occurring categories

ValueCountFrequency (%)
(unknown) 5382
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
n 798
14.8%
e 704
13.1%
a 368
 
6.8%
t 348
 
6.5%
r 346
 
6.4%
d 312
 
5.8%
o 275
 
5.1%
i 251
 
4.7%
s 215
 
4.0%
p 172
 
3.2%
Other values (37) 1593
29.6%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 5382
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
n 798
14.8%
e 704
13.1%
a 368
 
6.8%
t 348
 
6.5%
r 346
 
6.4%
d 312
 
5.8%
o 275
 
5.1%
i 251
 
4.7%
s 215
 
4.0%
p 172
 
3.2%
Other values (37) 1593
29.6%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 5382
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
n 798
14.8%
e 704
13.1%
a 368
 
6.8%
t 348
 
6.5%
r 346
 
6.4%
d 312
 
5.8%
o 275
 
5.1%
i 251
 
4.7%
s 215
 
4.0%
p 172
 
3.2%
Other values (37) 1593
29.6%

Rotten Tomatoes
Real number (ℝ)

High correlation 

Distinct100
Distinct (%)15.3%
Missing1
Missing (%)0.2%
Infinite0
Infinite (%)0.0%
Mean48.99084
Minimum0
Maximum99
Zeros1
Zeros (%)0.2%
Negative0
Negative (%)0.0%
Memory size5.3 KiB
2025-08-25T12:54:20.480324image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile9
Q126
median48
Q372
95-th percentile92
Maximum99
Range99
Interquartile range (IQR)46

Descriptive statistics

Standard deviation26.659191
Coefficient of variation (CV)0.54416685
Kurtosis-1.1594447
Mean48.99084
Median Absolute Deviation (MAD)23
Skewness0.084812409
Sum32089
Variance710.71245
MonotonicityNot monotonic
2025-08-25T12:54:20.511840image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
36 15
 
2.3%
14 15
 
2.3%
27 15
 
2.3%
52 13
 
2.0%
78 12
 
1.8%
19 11
 
1.7%
68 10
 
1.5%
46 10
 
1.5%
38 10
 
1.5%
13 10
 
1.5%
Other values (90) 534
81.4%
ValueCountFrequency (%)
0 1
 
0.2%
1 1
 
0.2%
2 3
 
0.5%
3 2
 
0.3%
4 5
0.8%
5 2
 
0.3%
6 5
0.8%
7 6
0.9%
8 4
0.6%
9 8
1.2%
ValueCountFrequency (%)
99 1
 
0.2%
98 2
 
0.3%
97 3
 
0.5%
96 4
0.6%
95 3
 
0.5%
94 9
1.4%
93 9
1.4%
92 6
0.9%
91 5
0.8%
90 5
0.8%

Audience Score
Real number (ℝ)

High correlation 

Distinct73
Distinct (%)11.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean59.910061
Minimum19
Maximum96
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.3 KiB
2025-08-25T12:54:20.541005image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

Minimum19
5-th percentile33
Q147
median59
Q373
95-th percentile87
Maximum96
Range77
Interquartile range (IQR)26

Descriptive statistics

Standard deviation16.579657
Coefficient of variation (CV)0.27674245
Kurtosis-0.78875615
Mean59.910061
Median Absolute Deviation (MAD)13
Skewness-0.015848479
Sum39301
Variance274.88503
MonotonicityNot monotonic
2025-08-25T12:54:20.572505image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
72 21
 
3.2%
50 20
 
3.0%
56 19
 
2.9%
73 19
 
2.9%
57 19
 
2.9%
61 16
 
2.4%
47 16
 
2.4%
55 15
 
2.3%
43 15
 
2.3%
48 15
 
2.3%
Other values (63) 481
73.3%
ValueCountFrequency (%)
19 1
 
0.2%
20 1
 
0.2%
22 1
 
0.2%
24 2
 
0.3%
25 1
 
0.2%
26 2
 
0.3%
27 3
0.5%
28 5
0.8%
29 4
0.6%
31 7
1.1%
ValueCountFrequency (%)
96 1
 
0.2%
93 4
0.6%
92 2
 
0.3%
91 5
0.8%
90 6
0.9%
89 6
0.9%
88 5
0.8%
87 9
1.4%
86 6
0.9%
85 5
0.8%

Story
Categorical

Distinct22
Distinct (%)3.4%
Missing2
Missing (%)0.3%
Memory size36.9 KiB
Comedy
87 
Love
73 
Monster Force
63 
Quest
59 
Rivalry
38 
Other values (17)
334 

Length

Max length18
Median length14
Mean length8.4266055
Min length4

Characters and Unicode

Total characters5511
Distinct characters37
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowEscape
2nd rowMonster Force
3rd rowJourney And Return
4th rowComedy
5th rowThe Riddle

Common Values

ValueCountFrequency (%)
Comedy 87
13.3%
Love 73
 
11.1%
Monster Force 63
 
9.6%
Quest 59
 
9.0%
Rivalry 38
 
5.8%
Discovery 36
 
5.5%
Pursuit 32
 
4.9%
Transformation 27
 
4.1%
Revenge 26
 
4.0%
Maturation 26
 
4.0%
Other values (12) 187
28.5%

Length

2025-08-25T12:54:20.605224image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
comedy 87
 
10.3%
love 73
 
8.7%
monster 63
 
7.5%
force 63
 
7.5%
quest 59
 
7.0%
rivalry 38
 
4.5%
discovery 36
 
4.3%
pursuit 32
 
3.8%
transformation 27
 
3.2%
revenge 26
 
3.1%
Other values (22) 339
40.2%

Most occurring characters

ValueCountFrequency (%)
e 729
 
13.2%
o 481
 
8.7%
r 430
 
7.8%
t 325
 
5.9%
s 322
 
5.8%
n 267
 
4.8%
i 259
 
4.7%
a 235
 
4.3%
u 232
 
4.2%
d 209
 
3.8%
Other values (27) 2022
36.7%

Most occurring categories

ValueCountFrequency (%)
(unknown) 5511
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
e 729
 
13.2%
o 481
 
8.7%
r 430
 
7.8%
t 325
 
5.9%
s 322
 
5.8%
n 267
 
4.8%
i 259
 
4.7%
a 235
 
4.3%
u 232
 
4.2%
d 209
 
3.8%
Other values (27) 2022
36.7%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 5511
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
e 729
 
13.2%
o 481
 
8.7%
r 430
 
7.8%
t 325
 
5.9%
s 322
 
5.8%
n 267
 
4.8%
i 259
 
4.7%
a 235
 
4.3%
u 232
 
4.2%
d 209
 
3.8%
Other values (27) 2022
36.7%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 5511
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
e 729
 
13.2%
o 481
 
8.7%
r 430
 
7.8%
t 325
 
5.9%
s 322
 
5.8%
n 267
 
4.8%
i 259
 
4.7%
a 235
 
4.3%
u 232
 
4.2%
d 209
 
3.8%
Other values (27) 2022
36.7%

Genre
Categorical

Distinct12
Distinct (%)1.8%
Missing0
Missing (%)0.0%
Memory size35.6 KiB
Comedy
175 
Action
159 
Drama
98 
Horror
50 
Animation
49 
Other values (7)
125 

Length

Max length11
Median length6
Mean length6.4314024
Min length5

Characters and Unicode

Total characters4219
Distinct characters26
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowAdventure
2nd rowHorror
3rd rowAdventure
4th rowComedy
5th rowDrama

Common Values

ValueCountFrequency (%)
Comedy 175
26.7%
Action 159
24.2%
Drama 98
14.9%
Horror 50
 
7.6%
Animation 49
 
7.5%
Thriller 33
 
5.0%
Adventure 31
 
4.7%
Romance 22
 
3.4%
Crime 16
 
2.4%
Biography 13
 
2.0%
Other values (2) 10
 
1.5%

Length

2025-08-25T12:54:20.629514image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
comedy 175
26.7%
action 159
24.2%
drama 98
14.9%
horror 50
 
7.6%
animation 49
 
7.5%
thriller 33
 
5.0%
adventure 31
 
4.7%
romance 22
 
3.4%
crime 16
 
2.4%
biography 13
 
2.0%
Other values (2) 10
 
1.5%

Most occurring characters

ValueCountFrequency (%)
o 523
12.4%
r 384
 
9.1%
m 365
 
8.7%
i 319
 
7.6%
e 318
 
7.5%
n 315
 
7.5%
a 285
 
6.8%
t 249
 
5.9%
A 239
 
5.7%
d 206
 
4.9%
Other values (16) 1016
24.1%

Most occurring categories

ValueCountFrequency (%)
(unknown) 4219
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
o 523
12.4%
r 384
 
9.1%
m 365
 
8.7%
i 319
 
7.6%
e 318
 
7.5%
n 315
 
7.5%
a 285
 
6.8%
t 249
 
5.9%
A 239
 
5.7%
d 206
 
4.9%
Other values (16) 1016
24.1%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 4219
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
o 523
12.4%
r 384
 
9.1%
m 365
 
8.7%
i 319
 
7.6%
e 318
 
7.5%
n 315
 
7.5%
a 285
 
6.8%
t 249
 
5.9%
A 239
 
5.7%
d 206
 
4.9%
Other values (16) 1016
24.1%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 4219
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
o 523
12.4%
r 384
 
9.1%
m 365
 
8.7%
i 319
 
7.6%
e 318
 
7.5%
n 315
 
7.5%
a 285
 
6.8%
t 249
 
5.9%
A 239
 
5.7%
d 206
 
4.9%
Other values (16) 1016
24.1%
Distinct533
Distinct (%)87.1%
Missing44
Missing (%)6.7%
Memory size33.1 KiB
2025-08-25T12:54:20.740471image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Length

Max length4
Median length4
Mean length3.8513072
Min length1

Characters and Unicode

Total characters2357
Distinct characters11
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique468 ?
Unique (%)76.5%

Sample

1st row916
2nd row3332
3rd row3728
4th row2251
5th row2
ValueCountFrequency (%)
4 8
 
1.3%
3175 3
 
0.5%
2756 3
 
0.5%
2534 3
 
0.5%
2511 3
 
0.5%
3606 3
 
0.5%
3030 3
 
0.5%
2470 3
 
0.5%
3121 3
 
0.5%
3376 2
 
0.3%
Other values (523) 578
94.4%
2025-08-25T12:54:20.879990image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 440
18.7%
3 375
15.9%
1 261
11.1%
5 239
10.1%
4 209
8.9%
0 209
8.9%
6 188
8.0%
7 153
 
6.5%
8 144
 
6.1%
9 138
 
5.9%

Most occurring categories

ValueCountFrequency (%)
(unknown) 2357
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
2 440
18.7%
3 375
15.9%
1 261
11.1%
5 239
10.1%
4 209
8.9%
0 209
8.9%
6 188
8.0%
7 153
 
6.5%
8 144
 
6.1%
9 138
 
5.9%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 2357
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
2 440
18.7%
3 375
15.9%
1 261
11.1%
5 239
10.1%
4 209
8.9%
0 209
8.9%
6 188
8.0%
7 153
 
6.5%
8 144
 
6.1%
9 138
 
5.9%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 2357
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
2 440
18.7%
3 375
15.9%
1 261
11.1%
5 239
10.1%
4 209
8.9%
0 209
8.9%
6 188
8.0%
7 153
 
6.5%
8 144
 
6.1%
9 138
 
5.9%

Box Office Average per Cinema
Real number (ℝ)

High correlation  Missing 

Distinct579
Distinct (%)96.2%
Missing54
Missing (%)8.2%
Infinite0
Infinite (%)0.0%
Mean8172.3322
Minimum1052
Maximum93230
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.3 KiB
2025-08-25T12:54:20.911840image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

Minimum1052
5-th percentile2243.8
Q13785
median5943
Q39741
95-th percentile22282.2
Maximum93230
Range92178
Interquartile range (IQR)5956

Descriptive statistics

Standard deviation7865.2285
Coefficient of variation (CV)0.96242153
Kurtosis29.296118
Mean8172.3322
Median Absolute Deviation (MAD)2567.5
Skewness4.1680711
Sum4919744
Variance61861820
MonotonicityNot monotonic
2025-08-25T12:54:20.942894image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4761 3
 
0.5%
3132 2
 
0.3%
4605 2
 
0.3%
1806 2
 
0.3%
9743 2
 
0.3%
3610 2
 
0.3%
4405 2
 
0.3%
7655 2
 
0.3%
5427 2
 
0.3%
3300 2
 
0.3%
Other values (569) 581
88.6%
(Missing) 54
 
8.2%
ValueCountFrequency (%)
1052 1
0.2%
1354 1
0.2%
1459 1
0.2%
1490 1
0.2%
1513 1
0.2%
1559 1
0.2%
1575 1
0.2%
1585 1
0.2%
1625 1
0.2%
1703 1
0.2%
ValueCountFrequency (%)
93230 1
0.2%
61777 1
0.2%
45429 1
0.2%
41890 1
0.2%
41038 1
0.2%
40385 1
0.2%
39384 1
0.2%
38672 1
0.2%
36338 1
0.2%
36283 1
0.2%

Domestic Gross
Real number (ℝ)

High correlation 

Distinct623
Distinct (%)95.8%
Missing6
Missing (%)0.9%
Infinite0
Infinite (%)0.0%
Mean67.996292
Minimum0
Maximum743.8
Zeros3
Zeros (%)0.5%
Negative0
Negative (%)0.0%
Memory size5.3 KiB
2025-08-25T12:54:20.972325image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile4.427
Q120.22
median40.195
Q388.8675
95-th percentile222.061
Maximum743.8
Range743.8
Interquartile range (IQR)68.6475

Descriptive statistics

Standard deviation77.446419
Coefficient of variation (CV)1.13898
Kurtosis13.121705
Mean67.996292
Median Absolute Deviation (MAD)26.585
Skewness2.8472878
Sum44197.59
Variance5997.9478
MonotonicityNot monotonic
2025-08-25T12:54:21.004022image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
31.7 3
 
0.5%
0 3
 
0.5%
25.93 2
 
0.3%
22.5 2
 
0.3%
24.54 2
 
0.3%
16 2
 
0.3%
9.2 2
 
0.3%
10.3 2
 
0.3%
0.02 2
 
0.3%
0.54 2
 
0.3%
Other values (613) 628
95.7%
(Missing) 6
 
0.9%
ValueCountFrequency (%)
0 3
0.5%
0.02 2
0.3%
0.03 2
0.3%
0.11 1
 
0.2%
0.22 1
 
0.2%
0.38 1
 
0.2%
0.41 1
 
0.2%
0.54 2
0.3%
0.58 1
 
0.2%
0.97 1
 
0.2%
ValueCountFrequency (%)
743.8 1
0.2%
530.92 1
0.2%
415 1
0.2%
402.1 1
0.2%
381.01 1
0.2%
352.39 1
0.2%
336.53 1
0.2%
334.19 1
0.2%
322.72 1
0.2%
319.25 1
0.2%

Foreign Gross
Text

Missing 

Distinct579
Distinct (%)94.8%
Missing45
Missing (%)6.9%
Memory size33.4 KiB
2025-08-25T12:54:21.120332image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Length

Max length6
Median length5
Mean length4.3698854
Min length1

Characters and Unicode

Total characters2670
Distinct characters12
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique557 ?
Unique (%)91.2%

Sample

1st row42.4
2nd row52.59
3rd row690.2
4th row6.26
5th row0.062
ValueCountFrequency (%)
10
 
1.6%
0 6
 
1.0%
11.2 2
 
0.3%
14.93 2
 
0.3%
0.01 2
 
0.3%
27.9 2
 
0.3%
0.87 2
 
0.3%
74.58 2
 
0.3%
193 2
 
0.3%
16.8 2
 
0.3%
Other values (568) 579
94.8%
2025-08-25T12:54:21.259817image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
. 543
20.3%
1 331
12.4%
2 255
9.6%
3 241
9.0%
4 224
8.4%
5 200
 
7.5%
7 194
 
7.3%
6 191
 
7.2%
9 174
 
6.5%
8 157
 
5.9%
Other values (2) 160
 
6.0%

Most occurring categories

ValueCountFrequency (%)
(unknown) 2670
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
. 543
20.3%
1 331
12.4%
2 255
9.6%
3 241
9.0%
4 224
8.4%
5 200
 
7.5%
7 194
 
7.3%
6 191
 
7.2%
9 174
 
6.5%
8 157
 
5.9%
Other values (2) 160
 
6.0%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 2670
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
. 543
20.3%
1 331
12.4%
2 255
9.6%
3 241
9.0%
4 224
8.4%
5 200
 
7.5%
7 194
 
7.3%
6 191
 
7.2%
9 174
 
6.5%
8 157
 
5.9%
Other values (2) 160
 
6.0%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 2670
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
. 543
20.3%
1 331
12.4%
2 255
9.6%
3 241
9.0%
4 224
8.4%
5 200
 
7.5%
7 194
 
7.3%
6 191
 
7.2%
9 174
 
6.5%
8 157
 
5.9%
Other values (2) 160
 
6.0%

Worldwide Gross
Real number (ℝ)

High correlation 

Distinct634
Distinct (%)97.2%
Missing4
Missing (%)0.6%
Infinite0
Infinite (%)0.0%
Mean151.07402
Minimum0.03
Maximum2712.85
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.3 KiB
2025-08-25T12:54:21.290832image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

Minimum0.03
5-th percentile8.6585
Q134.815
median75.7
Q3176.1825
95-th percentile576.367
Maximum2712.85
Range2712.82
Interquartile range (IQR)141.3675

Descriptive statistics

Standard deviation216.96465
Coefficient of variation (CV)1.436148
Kurtosis33.960663
Mean151.07402
Median Absolute Deviation (MAD)53.495
Skewness4.3536168
Sum98500.26
Variance47073.658
MonotonicityNot monotonic
2025-08-25T12:54:21.321554image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.03 2
 
0.3%
93.4 2
 
0.3%
63.8 2
 
0.3%
73.8 2
 
0.3%
183.3 2
 
0.3%
1.1 2
 
0.3%
29 2
 
0.3%
73.4 2
 
0.3%
29.37 2
 
0.3%
34.5 2
 
0.3%
Other values (624) 632
96.3%
(Missing) 4
 
0.6%
ValueCountFrequency (%)
0.03 2
0.3%
0.38 1
0.2%
0.41 1
0.2%
0.55 1
0.2%
0.64 1
0.2%
1.07 1
0.2%
1.1 2
0.3%
1.32 1
0.2%
1.57 1
0.2%
2.71 1
0.2%
ValueCountFrequency (%)
2712.85 1
0.2%
1328.11 1
0.2%
1123.2 1
0.2%
1063.16 1
0.2%
1043.87 1
0.2%
1024.39 1
0.2%
996.9 1
0.2%
961 1
0.2%
955.41 1
0.2%
939.88 1
0.2%

Budget
Real number (ℝ)

High correlation  Missing 

Distinct135
Distinct (%)20.9%
Missing11
Missing (%)1.7%
Infinite0
Infinite (%)0.0%
Mean54.096147
Minimum0
Maximum300
Zeros1
Zeros (%)0.2%
Negative0
Negative (%)0.0%
Memory size5.3 KiB
2025-08-25T12:54:21.357817image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile6
Q120
median35
Q370
95-th percentile166.6
Maximum300
Range300
Interquartile range (IQR)50

Descriptive statistics

Standard deviation51.370097
Coefficient of variation (CV)0.94960732
Kurtosis2.7571938
Mean54.096147
Median Absolute Deviation (MAD)20
Skewness1.706458
Sum34892.015
Variance2638.8869
MonotonicityNot monotonic
2025-08-25T12:54:21.432998image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20 49
 
7.5%
30 33
 
5.0%
25 29
 
4.4%
40 28
 
4.3%
35 25
 
3.8%
15 24
 
3.7%
60 21
 
3.2%
150 20
 
3.0%
50 19
 
2.9%
80 18
 
2.7%
Other values (125) 379
57.8%
ValueCountFrequency (%)
0 1
 
0.2%
0.015 1
 
0.2%
0.2 1
 
0.2%
0.5 1
 
0.2%
1.5 1
 
0.2%
1.7 1
 
0.2%
1.8 1
 
0.2%
2 3
0.5%
2.5 1
 
0.2%
2.6 1
 
0.2%
ValueCountFrequency (%)
300 1
 
0.2%
260 1
 
0.2%
258 1
 
0.2%
250 2
 
0.3%
237 1
 
0.2%
230 1
 
0.2%
210 1
 
0.2%
200 11
1.7%
195 1
 
0.2%
190 1
 
0.2%

Profit
Real number (ℝ)

High correlation 

Distinct637
Distinct (%)97.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean96.963788
Minimum-111.01
Maximum2475.85
Zeros4
Zeros (%)0.6%
Negative107
Negative (%)16.3%
Memory size5.3 KiB
2025-08-25T12:54:21.507144image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

Minimum-111.01
5-th percentile-18.1175
Q16.55
median39.925
Q3117.9225
95-th percentile438.6375
Maximum2475.85
Range2586.86
Interquartile range (IQR)111.3725

Descriptive statistics

Standard deviation182.58967
Coefficient of variation (CV)1.8830708
Kurtosis48.681841
Mean96.963788
Median Absolute Deviation (MAD)39.27
Skewness5.2178514
Sum63608.245
Variance33338.989
MonotonicityNot monotonic
2025-08-25T12:54:21.537166image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
13.5 4
 
0.6%
0 4
 
0.6%
-6.1 2
 
0.3%
38.3 2
 
0.3%
14 2
 
0.3%
11.2 2
 
0.3%
3.3 2
 
0.3%
1.12 2
 
0.3%
26.8 2
 
0.3%
-2.9 2
 
0.3%
Other values (627) 632
96.3%
ValueCountFrequency (%)
-111.01 1
0.2%
-88.5 1
0.2%
-58.8 1
0.2%
-50.5 1
0.2%
-50 1
0.2%
-44.83 1
0.2%
-41.2 1
0.2%
-39.83 1
0.2%
-37.66 1
0.2%
-36.1 1
0.2%
ValueCountFrequency (%)
2475.85 1
0.2%
1203.11 1
0.2%
928.2 1
0.2%
863.16 1
0.2%
830.41 1
0.2%
824.39 1
0.2%
811.9 1
0.2%
794.5 1
0.2%
793.87 1
0.2%
789.88 1
0.2%

Proftitability
Text

Missing 

Distinct583
Distinct (%)90.4%
Missing11
Missing (%)1.7%
Memory size34.8 KiB
2025-08-25T12:54:21.642803image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Length

Max length9
Median length8
Mean length5.5643411
Min length1

Characters and Unicode

Total characters3589
Distinct characters20
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique534 ?
Unique (%)82.8%

Sample

1st row337.39%
2nd row330.46%
3rd row512.20%
4th row267.53%
5th row3.20%
ValueCountFrequency (%)
1.93 5
 
0.8%
2.1 4
 
0.6%
0.9 3
 
0.5%
6.83 3
 
0.5%
1.47 3
 
0.5%
1.3 3
 
0.5%
1.07 3
 
0.5%
2.09 3
 
0.5%
2.03 3
 
0.5%
1.73 3
 
0.5%
Other values (572) 612
94.9%
2025-08-25T12:54:21.775407image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
. 642
17.9%
1 374
10.4%
% 360
10.0%
2 338
9.4%
3 313
8.7%
0 269
7.5%
4 233
 
6.5%
5 223
 
6.2%
8 217
 
6.0%
7 214
 
6.0%
Other values (10) 406
11.3%

Most occurring categories

ValueCountFrequency (%)
(unknown) 3589
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
. 642
17.9%
1 374
10.4%
% 360
10.0%
2 338
9.4%
3 313
8.7%
0 269
7.5%
4 233
 
6.5%
5 223
 
6.2%
8 217
 
6.0%
7 214
 
6.0%
Other values (10) 406
11.3%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 3589
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
. 642
17.9%
1 374
10.4%
% 360
10.0%
2 338
9.4%
3 313
8.7%
0 269
7.5%
4 233
 
6.5%
5 223
 
6.2%
8 217
 
6.0%
7 214
 
6.0%
Other values (10) 406
11.3%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 3589
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
. 642
17.9%
1 374
10.4%
% 360
10.0%
2 338
9.4%
3 313
8.7%
0 269
7.5%
4 233
 
6.5%
5 223
 
6.2%
8 217
 
6.0%
7 214
 
6.0%
Other values (10) 406
11.3%
Distinct484
Distinct (%)74.3%
Missing5
Missing (%)0.8%
Memory size33.9 KiB
2025-08-25T12:54:21.895765image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Length

Max length6
Median length4
Mean length3.8141321
Min length1

Characters and Unicode

Total characters2483
Distinct characters12
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique374 ?
Unique (%)57.5%

Sample

1st row0.26
2nd row32.9
3rd row116.1
4th row11.2
5th row0.037
ValueCountFrequency (%)
14 5
 
0.8%
17.6 5
 
0.8%
6.9 5
 
0.8%
5.4 4
 
0.6%
10 4
 
0.6%
11.2 4
 
0.6%
10.1 4
 
0.6%
7.6 4
 
0.6%
4.9 4
 
0.6%
10.6 4
 
0.6%
Other values (474) 608
93.4%
2025-08-25T12:54:22.043082image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
. 587
23.6%
1 371
14.9%
2 229
 
9.2%
3 213
 
8.6%
4 188
 
7.6%
5 177
 
7.1%
6 164
 
6.6%
0 162
 
6.5%
7 145
 
5.8%
8 130
 
5.2%
Other values (2) 117
 
4.7%

Most occurring categories

ValueCountFrequency (%)
(unknown) 2483
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
. 587
23.6%
1 371
14.9%
2 229
 
9.2%
3 213
 
8.6%
4 188
 
7.6%
5 177
 
7.1%
6 164
 
6.6%
0 162
 
6.5%
7 145
 
5.8%
8 130
 
5.2%
Other values (2) 117
 
4.7%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 2483
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
. 587
23.6%
1 371
14.9%
2 229
 
9.2%
3 213
 
8.6%
4 188
 
7.6%
5 177
 
7.1%
6 164
 
6.6%
0 162
 
6.5%
7 145
 
5.8%
8 130
 
5.2%
Other values (2) 117
 
4.7%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 2483
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
. 587
23.6%
1 371
14.9%
2 229
 
9.2%
3 213
 
8.6%
4 188
 
7.6%
5 177
 
7.1%
6 164
 
6.6%
0 162
 
6.5%
7 145
 
5.8%
8 130
 
5.2%
Other values (2) 117
 
4.7%

Oscar
Text

Missing 

Distinct12
Distinct (%)75.0%
Missing640
Missing (%)97.6%
Memory size21.3 KiB
2025-08-25T12:54:22.087553image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Length

Max length60
Median length36.5
Mean length23.375
Min length8

Characters and Unicode

Total characters374
Distinct characters25
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9 ?
Unique (%)56.2%

Sample

1st rowBest Actress
2nd rowSup. Actor, Sup. Actress
3rd rowBest Picture, Director, Actor, Orig. Screenplay
4th rowOriginal Screenplay
5th rowBest Picture, Director, Supporting Actor, Adapted Screenplay
ValueCountFrequency (%)
best 8
16.7%
actor 7
14.6%
screenplay 6
12.5%
actress 4
8.3%
supporting 4
8.3%
picture 4
8.3%
director 4
8.3%
animated 3
 
6.2%
original 3
 
6.2%
sup 2
 
4.2%
Other values (2) 3
 
6.2%
2025-08-25T12:54:22.155702image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
r 37
 
9.9%
e 37
 
9.9%
t 36
 
9.6%
32
 
8.6%
c 25
 
6.7%
i 22
 
5.9%
p 18
 
4.8%
s 16
 
4.3%
n 16
 
4.3%
A 16
 
4.3%
Other values (15) 119
31.8%

Most occurring categories

ValueCountFrequency (%)
(unknown) 374
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
r 37
 
9.9%
e 37
 
9.9%
t 36
 
9.6%
32
 
8.6%
c 25
 
6.7%
i 22
 
5.9%
p 18
 
4.8%
s 16
 
4.3%
n 16
 
4.3%
A 16
 
4.3%
Other values (15) 119
31.8%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 374
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
r 37
 
9.9%
e 37
 
9.9%
t 36
 
9.6%
32
 
8.6%
c 25
 
6.7%
i 22
 
5.9%
p 18
 
4.8%
s 16
 
4.3%
n 16
 
4.3%
A 16
 
4.3%
Other values (15) 119
31.8%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 374
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
r 37
 
9.9%
e 37
 
9.9%
t 36
 
9.6%
32
 
8.6%
c 25
 
6.7%
i 22
 
5.9%
p 18
 
4.8%
s 16
 
4.3%
n 16
 
4.3%
A 16
 
4.3%
Other values (15) 119
31.8%

Bafta
Text

Missing 

Distinct9
Distinct (%)75.0%
Missing644
Missing (%)98.2%
Memory size21.0 KiB
2025-08-25T12:54:22.199447image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Length

Max length39
Median length26
Mean length18.666667
Min length8

Characters and Unicode

Total characters224
Distinct characters25
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7 ?
Unique (%)58.3%

Sample

1st rowOriginal Screenplay
2nd rowSupporting Actor, Director
3rd rowAnimated
4th rowSupporting Actress
5th rowLeading Actor
ValueCountFrequency (%)
supporting 4
14.8%
actor 4
14.8%
screenplay 4
14.8%
animated 3
11.1%
director 3
11.1%
original 2
7.4%
film 2
7.4%
adapted 2
7.4%
actress 1
 
3.7%
leading 1
 
3.7%
2025-08-25T12:54:22.268492image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
r 21
 
9.4%
e 19
 
8.5%
t 18
 
8.0%
i 17
 
7.6%
15
 
6.7%
p 14
 
6.2%
n 14
 
6.2%
c 12
 
5.4%
a 12
 
5.4%
o 11
 
4.9%
Other values (15) 71
31.7%

Most occurring categories

ValueCountFrequency (%)
(unknown) 224
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
r 21
 
9.4%
e 19
 
8.5%
t 18
 
8.0%
i 17
 
7.6%
15
 
6.7%
p 14
 
6.2%
n 14
 
6.2%
c 12
 
5.4%
a 12
 
5.4%
o 11
 
4.9%
Other values (15) 71
31.7%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 224
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
r 21
 
9.4%
e 19
 
8.5%
t 18
 
8.0%
i 17
 
7.6%
15
 
6.7%
p 14
 
6.2%
n 14
 
6.2%
c 12
 
5.4%
a 12
 
5.4%
o 11
 
4.9%
Other values (15) 71
31.7%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 224
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
r 21
 
9.4%
e 19
 
8.5%
t 18
 
8.0%
i 17
 
7.6%
15
 
6.7%
p 14
 
6.2%
n 14
 
6.2%
c 12
 
5.4%
a 12
 
5.4%
o 11
 
4.9%
Other values (15) 71
31.7%

Source
Text

Missing 

Distinct223
Distinct (%)73.1%
Missing351
Missing (%)53.5%
Memory size41.3 KiB
2025-08-25T12:54:22.351259image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Length

Max length140
Median length107
Mean length52.367213
Min length24

Characters and Unicode

Total characters15972
Distinct characters76
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique218 ?
Unique (%)71.5%

Sample

1st rowhttp://boxofficemojo.com/movies/?id=127hours.htm
2nd rowhttp://www.the-numbers.com/movies/2009/ABSTV.php
3rd rowhttp://www.wikipedia.org
4th rowhttp://boxofficemojo.com/movies
5th rowhttp://boxofficemojo.com/movies
ValueCountFrequency (%)
http://www.the-numbers.com/movies/records/allbudgets.php 66
 
20.3%
http://boxofficemojo.com/movies 13
 
4.0%
http://www.the-numbers.com/movies/2009/abstv.php 4
 
1.2%
http://latimesblogs.latimes.com/entertainmentnewsbuzz/2011/10/movie-projector-real-steel-ides-of-march.html 2
 
0.6%
http://latimesblogs.latimes.com/entertainmentnewsbuzz/2011/11/muppets-arthur-christmas-hugo-box-office.html 2
 
0.6%
2
 
0.6%
unofficial 1
 
0.3%
for 1
 
0.3%
links 1
 
0.3%
see 1
 
0.3%
Other values (232) 232
71.4%
2025-08-25T12:54:22.462995image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
o 1510
 
9.5%
/ 1327
 
8.3%
t 1149
 
7.2%
e 1118
 
7.0%
m 1111
 
7.0%
i 830
 
5.2%
h 790
 
4.9%
. 720
 
4.5%
s 699
 
4.4%
p 602
 
3.8%
Other values (66) 6116
38.3%

Most occurring categories

ValueCountFrequency (%)
(unknown) 15972
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
o 1510
 
9.5%
/ 1327
 
8.3%
t 1149
 
7.2%
e 1118
 
7.0%
m 1111
 
7.0%
i 830
 
5.2%
h 790
 
4.9%
. 720
 
4.5%
s 699
 
4.4%
p 602
 
3.8%
Other values (66) 6116
38.3%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 15972
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
o 1510
 
9.5%
/ 1327
 
8.3%
t 1149
 
7.2%
e 1118
 
7.0%
m 1111
 
7.0%
i 830
 
5.2%
h 790
 
4.9%
. 720
 
4.5%
s 699
 
4.4%
p 602
 
3.8%
Other values (66) 6116
38.3%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 15972
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
o 1510
 
9.5%
/ 1327
 
8.3%
t 1149
 
7.2%
e 1118
 
7.0%
m 1111
 
7.0%
i 830
 
5.2%
h 790
 
4.9%
. 720
 
4.5%
s 699
 
4.4%
p 602
 
3.8%
Other values (66) 6116
38.3%

Column
URL

Constant  Missing 

Distinct1
Distinct (%)100.0%
Missing655
Missing (%)99.8%
Memory size20.7 KiB
http://www.shadowandact.com/?p=23430
 
1
(Missing)
655 
ValueCountFrequency (%)
http://www.shadowandact.com/?p=23430 1
 
0.2%
(Missing) 655
99.8%
ValueCountFrequency (%)
http 1
 
0.2%
(Missing) 655
99.8%
ValueCountFrequency (%)
www.shadowandact.com 1
 
0.2%
(Missing) 655
99.8%
ValueCountFrequency (%)
/ 1
 
0.2%
(Missing) 655
99.8%
ValueCountFrequency (%)
p=23430 1
 
0.2%
(Missing) 655
99.8%
ValueCountFrequency (%)
1
 
0.2%
(Missing) 655
99.8%

Interactions

2025-08-25T12:54:19.333569image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-08-25T12:54:17.860565image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-08-25T12:54:18.157908image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-08-25T12:54:18.347327image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-08-25T12:54:18.546714image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-08-25T12:54:18.767761image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-08-25T12:54:18.965124image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-08-25T12:54:19.148797image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-08-25T12:54:19.356426image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-08-25T12:54:17.916095image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-08-25T12:54:18.180974image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-08-25T12:54:18.373818image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-08-25T12:54:18.608618image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-08-25T12:54:18.790854image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-08-25T12:54:18.987532image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-08-25T12:54:19.170932image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-08-25T12:54:19.379231image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-08-25T12:54:17.963728image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-08-25T12:54:18.202838image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-08-25T12:54:18.397641image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-08-25T12:54:18.630609image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-08-25T12:54:18.815406image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-08-25T12:54:19.011442image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-08-25T12:54:19.194477image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-08-25T12:54:19.447043image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-08-25T12:54:17.996488image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-08-25T12:54:18.228326image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-08-25T12:54:18.423078image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-08-25T12:54:18.653589image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-08-25T12:54:18.840659image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-08-25T12:54:19.035320image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-08-25T12:54:19.218237image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-08-25T12:54:19.470185image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-08-25T12:54:18.040542image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-08-25T12:54:18.249862image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-08-25T12:54:18.446974image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-08-25T12:54:18.675036image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-08-25T12:54:18.865473image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-08-25T12:54:19.058489image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-08-25T12:54:19.241918image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-08-25T12:54:19.494019image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-08-25T12:54:18.083304image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-08-25T12:54:18.274614image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-08-25T12:54:18.473596image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-08-25T12:54:18.700498image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-08-25T12:54:18.891698image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-08-25T12:54:19.081193image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-08-25T12:54:19.265836image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-08-25T12:54:19.515787image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-08-25T12:54:18.107051image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-08-25T12:54:18.297992image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-08-25T12:54:18.497499image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-08-25T12:54:18.723395image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-08-25T12:54:18.915228image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-08-25T12:54:19.104116image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-08-25T12:54:19.288945image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-08-25T12:54:19.598583image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-08-25T12:54:18.134448image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-08-25T12:54:18.323225image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-08-25T12:54:18.523041image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-08-25T12:54:18.745679image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-08-25T12:54:18.940290image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-08-25T12:54:19.126043image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-08-25T12:54:19.311356image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Correlations

2025-08-25T12:54:22.482263image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Audience ScoreBox Office Average per CinemaBudgetDomestic GrossGenreLead StudioProfitRotten TomatoesStoryWorldwide Grossidyear
Audience Score1.0000.4350.1620.4070.0930.0240.3970.6700.1230.369-0.0180.082
Box Office Average per Cinema0.4351.0000.3950.7660.1080.0000.6980.2820.0670.703-0.0410.000
Budget0.1620.3951.0000.6500.1490.1410.422-0.0030.1280.709-0.0180.038
Domestic Gross0.4070.7660.6501.0000.1240.0000.8660.1600.1290.938-0.0360.000
Genre0.0930.1080.1490.1241.0000.0480.1350.0910.3500.1720.1130.151
Lead Studio0.0240.0000.1410.0000.0481.0000.2080.0480.0810.2130.1960.347
Profit0.3970.6980.4220.8660.1350.2081.0000.1790.1160.910-0.0610.000
Rotten Tomatoes0.6700.282-0.0030.1600.0910.0480.1791.0000.0610.130-0.0640.033
Story0.1230.0670.1280.1290.3500.0810.1160.0611.0000.1330.0800.115
Worldwide Gross0.3690.7030.7090.9380.1720.2130.9100.1300.1331.000-0.0540.047
id-0.018-0.041-0.018-0.0360.1130.196-0.061-0.0640.080-0.0541.0000.901
year0.0820.0000.0380.0000.1510.3470.0000.0330.1150.0470.9011.000

Missing values

2025-08-25T12:54:19.641757image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
A simple visualization of nullity by column.
2025-08-25T12:54:19.690490image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2025-08-25T12:54:19.766649image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

movies_ididyearexcludeFilmLead StudioRotten TomatoesAudience ScoreStoryGenreNumber of Theatres in Opening Weekend (US)Box Office Average per CinemaDomestic GrossForeign GrossWorldwide GrossBudgetProfitProftitabilityOpening WeekendOscarBaftaSourceColumn
0movies-000012010NaN127 HoursIndependent93.084EscapeAdventure9162333.018.3342.460.7318.042.73337.39%0.26NaNNaNhttp://boxofficemojo.com/movies/?id=127hours.htmNaN
1movies-000122010NaNA Nightmare on Elm StreetWarner Bros.13.040Monster ForceHorror33329875.063.0852.59115.6635.080.66330.46%32.9NaNNaNNaNNaN
2movies-000232010NaNAlice in WonderlandDisney52.072Journey And ReturnAdventure372831143.0334.19690.21024.39200.0824.39512.20%116.1NaNNaNNaNNaN
3movies-000342010NaNAll About SteveIndependent6.035ComedyComedy22514994.033.866.2640.1315.025.13267.53%11.2NaNNaNhttp://www.the-numbers.com/movies/2009/ABSTV.phpNaN
4movies-000452010yAll Good ThingsIndependent33.064The RiddleDrama2NaN0.580.0620.6420.0-19.363.20%0.037NaNNaNhttp://www.wikipedia.orgNaN
5movies-000562010NaNAlpha and OmegaCrest17.041Journey And ReturnAnimation26253469.025.124.829.9120.09.91149.55%9.1NaNNaNNaNNaN
6movies-000672010yBarry MundayIndependent43.043MaturationComedyNaNNaNNaNNaNNaNNaN0.00NaNNaNNaNNaNNaNNaN
7movies-000782010NaNBlack SwanFox88.086Wretched ExcessDrama9598742.0106.95222.44329.3913.0316.392533.77%8.38Best ActressNaNNaNNaN
8movies-000892010NaNBrooklyn's FinestIndependent42.047TemptationAction19366896.027.209.1536.3117.019.31213.59%13.4NaNNaNhttp://boxofficemojo.com/moviesNaN
9movies-0009102010NaNBuriedIndependent86.063EscapeDrama119115.01.0417.3318.382.016.38919.00%0.103NaNNaNhttp://boxofficemojo.com/moviesNaN
movies_ididyearexcludeFilmLead StudioRotten TomatoesAudience ScoreStoryGenreNumber of Theatres in Opening Weekend (US)Box Office Average per CinemaDomestic GrossForeign GrossWorldwide GrossBudgetProfitProftitabilityOpening WeekendOscarBaftaSourceColumn
646movies-06466472009NaNUpDisney98.086Journey And ReturnAnimation376618085.0293.00434727.10175.0552.104.1568.1AnimatedAnimatedhttp://www.the-numbers.com/movies/2009/UP.phpNaN
647movies-06476482009NaNUp in the AirParamount90.076MaturationDrama18955947.083.8278.2162.0225.0137.026.4811.2NaNAdapted Screenplayhttp://boxofficemojo.com/movies/?id=upintheair.htmNaN
648movies-06486492009NaNWatchmenWarner Bros.64.068SacrificeAction361115291.0107.5077.7185.30138.047.301.3455.2NaNNaNhttp://www.the-numbers.com/movies/records/allbudgets.phpNaN
649movies-06496502009yWhatever WorksNaN50.063DiscoveryComedy929574.05.3023.729.0015.014.001.930.26NaNNaNhttp://en.wikipedia.org/wiki/Whatever_WorksNaN
650movies-06506512009NaNWhere the Wild Things AreWarner Bros.73.059Journey And ReturnAdventure37358754.063.4016.385.30100.0-14.700.8532.7NaNNaNhttp://www.the-numbers.com/movies/records/allbudgets.phpNaN
651movies-06516522009yWhip ItNaN84.073MaturationDrama17212702.013.00316.0015.01.001.074.7NaNNaNhttp://www.boxofficemojo.com/movies/?id=whipit.htmNaN
652movies-06526532009yWhiteoutNaN7.028PursuitAction27451791.010.301.912.2035.0-22.800.354.9NaNNaNhttp://www.the-numbers.com/movies/records/allbudgets.phpNaN
653movies-06536542009NaNX-Men Origins: WolverineFox37.072RevengeAction409920751.0179.90193.2373.10150.0223.102.4985.1NaNNaNhttp://www.the-numbers.com/movies/records/allbudgets.phpNaN
654movies-06546552009yYear OneNaN14.031QuestAdventure30226489.032.4026.260.2060.00.20119.6NaNNaNhttp://www.the-numbers.com/movies/records/allbudgets.phpNaN
655movies-06556562009NaNZombielandSony90.087Monster ForceAction30368147.049.2042.593.3023.669.703.9524.7NaNNaNhttp://www.the-numbers.com/movies/records/allbudgets.phpNaN